Reinforcement learning - PDFSEARCH.IO - Document Search Engine

Reinforcement learning
Results: 1147

#	Item
511	Novel Writing Assignments in the Psychology of Learning John Kulig ~~ ~ Add to Reading List Source URL: wac.colostate.edu Language: English - Date: 2002-06-05 19:49:43 Mind Educational psychology Developmental psychology Reinforcement Psychology Learning Motivation E-learning Writing Across the Curriculum Behavior Education Behaviorism
512	Laurent Charlin – Curriculum Vitae 424 Rue St-Zotique Est Montreal, QC H2S 1L9 + Add to Reading List Source URL: www.cs.toronto.edu Language: English - Date: 2015-03-01 20:46:17 International Conference on Machine Learning Conference on Neural Information Processing Systems Reinforcement learning Partially observable Markov decision process Institute of Electrical and Electronics Engineers Artificial intelligence Machine learning Statistics
513	Policy Iteration for Learning an Exercise Policy for American Options Yuxi Li, Dale Schuurmans Department of Computing Science, University of Alberta Abstract. Options are important financial instruments, whose prices Add to Reading List Source URL: webdocs.cs.ualberta.ca Language: English - Date: 2008-09-08 12:22:02 Economics Autoregressive conditional heteroskedasticity Stochastic volatility Asian option TVR Reinforcement learning Normal distribution LSm Economic model Options Financial economics Statistics
514	Learning to Fire at Targets by an iCub Humanoid Robot Vishnu K. Nath and Stephen E. Levinson University of Illinois at Urbana-Champaign 405 North Mathews Avenue Urbana, IL 61801 Add to Reading List Source URL: www.isle.illinois.edu Language: English - Date: 2013-01-07 15:46:56 Markov models Computer vision Robotics ICub Humanoid robot Pi Reinforcement learning Robot Algorithm Mathematical analysis Mathematics Science and technology in Europe
515	Practical Issues in Temporal Diﬀerence Learning∗ Gerald Tesauro IBM Thomas J. Watson Research Center PO Box 704, Yorktown Heights, NYUSA Abstract. This paper examines whether temporal diﬀerence methods for t Add to Reading List Source URL: aass.oru.se Language: English - Date: 2005-06-14 12:26:47 Learning Neural networks Reinforcement learning Supervised learning Temporal difference learning Backgammon E-learning Algorithm Backpropagation Machine learning Computational neuroscience Games
516	Journal of Artificial Intelligence Research Submitted 3/13; publishedA Survey of Multi-Objective Sequential Decision-Making Diederik M. Roijers Add to Reading List Source URL: www.jair.org Language: English - Date: 2013-10-18 15:20:49 Systems theory Mathematical optimization Operations research Equations Stochastic control Reinforcement learning Markov decision process Bellman equation Policy Statistics Control theory Dynamic programming
517	Journal of Artificial Intelligence Research Submitted 01/05; published 1/06 Approximate Policy Iteration with a Policy Language Bias: Solving Relational Markov Decision Processes Add to Reading List Source URL: www.jair.org Language: English - Date: 2009-08-06 19:20:12 Stochastic control Reinforcement learning Markov decision process Valuation Policy Statistics Dynamic programming Markov processes
518	13. Reinforcement Learning Read Chapter 13] Exercises 13.1, 13.2, 13.4] Control learning Control policies that choose optimal actions Q learning Add to Reading List Source URL: aass.oru.se Language: English - Date: 2005-03-31 12:57:45 Stochastic control Q-learning Reinforcement learning Markov decision process Machine learning SARSA Statistics Dynamic programming Markov processes
519	Stable Dual Dynamic Programming Tao Wang Daniel Lizotte Michael Bowling Dale Schuurmans Department of Computing Science Add to Reading List Source URL: webdocs.cs.ualberta.ca Language: English - Date: 2007-10-21 19:53:48 Convex optimization Mathematical optimization Number theory Topological groups Markov decision process Linear programming Reinforcement learning Representation theory Μ operator Mathematics Algebra Operations research
520	Reverse Iterative Deepening for Finite-Horizon MDPs with Large Branching Factors Andrey Kolobov? Peng Dai† ∗ Mausam? Daniel S. Weld? Add to Reading List Source URL: www.cs.washington.edu Language: English - Date: 2013-08-13 03:53:46 Dynamic programming Systems theory Equations Operations research Stochastic control Markov decision process Reinforcement learning Bellman equation Automated planning and scheduling Statistics Mathematical optimization Control theory

UPDATE